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ABSTRACT 
Machine learning is a fascinating topic; it's astonishing how a small 


change in the evaluation values may result in an unfathomable Pal "Skin Cancer — =_ 
eal-Time" 


number of outcomes. The goal of this study is to develop a model 
that uses image processing to identify skin cancer. We will later use 
the model in real-life through an android application. 
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INTRODUCTION 
What is image processing and why are we using it? 


Image processing is a technique for. performing 
Operations on an image in order to improve it or 
extract useful information from it. It's a type of signal 
processing in which the input is an image and the 
output is either an image or its 
characteristics/features. 


Image processing mainly include the following steps: 

1. Importing the image via image acquisition tools; 

2. Analysing and manipulating the image; 

3. Output in which result can be altered image or a 
report which 1s based on analysing that image. 


Image as a Matrix 

As we know, images are represented in rows and 
columns. The syntax for representing images 1s as 
follows: 


How to cite this paper: Sunami 
Dasgupta | Soham Das | Sayani Hazra 


Image-Processing in 
Published in : 
International Journal 
of Trend in 
Scientific Research 
and Development 
(ajtsrd), ISSN: 2456- 
6470, Volume-5 | 
Issue-6, October 
2021, pp.271-280, URL: 
www.ijtsrd.com/papers/ijtsrd463 84. pdf 





























Copyright © 2021 by author(s) and 
International Journal of Trend in 
Scientific Research and Development 
Journal. This is an 
Open Access article 
distributed under the 
terms of the Creative Commons 
Attribution License (CC BY 4.0) 


(http://creativecommons.org/licenses/by/4.0) 





(0,0) (0,1) f(0,2) f(0,N-1) 
fix.y) = ° 1,0) : 1,1) . 1,2) oa i 1,N-1) 

#(M-1,0) f(M-1,1) f(M-1,2) f(M-LN-1) 
In MATLAB: 

Fii, i) FCT, 2) Fil, WN) 
f fiz, 1) Tle, 2) Ti(2, N) 

FiM, 1) T(M, 2) Fil, MN) 


Phases of Image Processing 

There are different phases to import and extract data 
from an Image. It consists of 10 major steps. They are 
as follows: 
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1. ACQUISITION- It could be as fundamental as 
being given a digital image to work with. The 
main work involves: 

Scaling. 

Colour conversion (RGB to Gray or vice-versa). 


2. IMAGE ENHANCEMENT- It is one of the most 
basic and appealing aspects of Image Processing, 
and it 1s also used to extract some hidden 
elements from an image. It is subjective. 


3. IMAGE RESTORATION - This likewise has to 
do with making an image appealing, but it is more 
objective (Restoration is based on mathematical 
or probabilistic model or image degradation). 


4. COLOR IMAGE PROCESSING-— This section 
covers pseudo-colour and full colour image 
processing, as well as colour models that can be 
used in digital image processing. 


5. WAVELETS AND MULTI-RESOLUTION 
PROCESSING— This is the foundation for 
portraying images in a variety of ways. 


6. IMAGE COMPRESSION—This _ procedure 
necessitates the creation of various functions. It 
primarily concerns image size or resolution. 


7. MORPHOLOGICAL PROCESSING-It deals 
with tools for extracting picture components that 
can be used to represent and describe shape. 


8. SEGMENTATION PROCEDURE -Partitioning 
an image into its constituent sections or objects is 
part of this procedure. The most demanding 
problem in Image Processing is autonomous 
segmentation. 


9. REPRESENTATION & DESCRIPTION-It 
comes after the output of the segmentation step; 
selecting a representation is only one part of the 
solution for converting raw data into processed 
data. 


10. OBJECT DETECTION AND RECOGNITION-It 
is a process that assigns a label to an object based 
on its descriptor. 


7 > 


Overlapping Fields with Image Processing 


OUTPUT 
INPUT 
IMAGE DESCRIPTION 
IMAGE poneoienste : COMPUTER VISION 
DESCRIPTION §f computer GRAPHICS [| ARTIFICIAL INTELLIGENCE 





1. According to block 1, if the input is an image and 
the output is an image, the process is known as 
digital image processing. 


2. According to block 2, Computer Vision is defined 
as an input that is an image and an output that 1s 
some kind of information or description. 


3. According to block 3, if input is some description 
or code and we get image as an output, then it 1s 
termed as Computer Graphics. 


4. According to block 4, if input is description or 
some keywords or some code and we get 
description or some keywords as an output, then it 
is termed as Artificial Intelligence. 


Skin Cancer Overview 

Skin cancer, or the abnormal proliferation of skin 
cells, is most commonly found on sun-exposed skin. 
However, this frequent type of cancer can also 
develop on parts of your skin that aren't normally 
exposed to the sun. 


Basal cell carcinoma, squamous cell carcinoma, and 
melanoma are the three most common kinds of skin 
cancer. 


Limiting or avoiding ultraviolet (UV) radiation 
exposure can lower your risk of skin cancer. Skin 
cancer can be detected at an early stage if you check 
your skin for abnormal changes. Early skin cancer 
detection increases your chances of a successful skin 
cancer treatment. 


1. Basal cell carcinoma - A kind of skin cancer 
known as basal cell carcinoma. Basal cell 
carcinoma starts in the basal cells, which are a 
type of skin cell that creates new skin cells when 
the old ones die. Basal cell carcinoma often 
appears as a slightly transparent bump on the 
skin, though it can take other forms. Basal cell 
carcinoma occurs most often on areas of the skin 
that are exposed to the sun, such as your head and 
neck. 


2. Melanoma — It originates in the cells 
(melanocytes) that create melanin, the pigment 
that gives your skin its colour. It is the most 
dangerous type of skin cancer. Melanoma can 
also develop in the eyes and, in rare cases, inside 
the body, such as the nose or throat. Although the 
specific aetiology of all melanomas is unknown, 
ultraviolet (UV) radiation from the sun, tanning 
lights, and beds increases your risk of acquiring 
melanoma. Melanoma risk can be reduced by 
limiting your exposure to UV light. 


3. Nonmelanoma skin cancer - It encompasses all 
skin cancers that aren't melanoma. Nonmelanoma 
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skin cancer encompasses a number of different 
forms of skin cancer, the most frequent of which 
are basal cell carcinoma and squamous cell 
carcinoma. Treatment for nonmelanoma skin 
cancer is determined by the type of malignancy. 
The most common therapy for skin cancer 1s 
surgery to remove the cancerous cells. 


PREREQUISITES 

1. Dataset for Training. 

2. Dataset for Testing. 

3. Machine learning model (that takes in digital 
Images as input and predicts cancer cells). 


EXPERIMENTATION 


Dataset 
We used the dataset provided by SIIM-ISIC 
Melanoma Classification. 


The images are provided in DICOM format. This can 
be accessed using commonly-available libraries like 
PyDicom, and contains both image and metadata. It is 
a commonly used medical imaging data format. 


Images are also provided in JPEG and TF Record 
format (in the jpeg and TF records directories, 
respectively). Images in TF Record format have been 
resized to a uniform 1024x1024. 


TensorFlow and Keras are used to build and create a machine learning model. 


1. Data Exploration 


We created algorithms and models to classify between benign and malignant skin cancers using Convolutional 


Neural Networks. We use Colab for source code editing. 


import tensorflow as tf 

import keras 

from keras import initializers 

from keras import regularizers 

from keras import constraints 

from keras import backend as K 

from keras.activations import elu 
from keras.optimizers import Adam 
from keras.models import Sequential 


from keras.engine import Layer, InputSpec 


from keras.utils.generic_utils import get_custom_objects 


from keras.callbacks import Callback, EarlyStopping, ReduceLROnPlateau 


from keras.layers import Dense, Conv2D, Flatten, GlobalAveragePooling2D, 


g2D, BatchNormalization, GlobalMaxPooling2D 


Dropout, MaxPoolin 


from keras.preprocessing.image import ImageDataGenerator 


from sklearn.metrics import cohen_kappa_score 


SAVED_MODEL_NAME = ‘effnet_model1B5.h5' 


2. Seeding 


The seed sets the starting number used to generate a sequence of random numbers it ensures that you get the 
same result if you start with that same seed each time you run the same process. 


seed = 1234 

rn.seed( seed) 

np.random.seed(seed) 

tf .random.set_seed(seed) 

os.environ[ 'PYTHONHASHSEED' |] = str(seed) 


t_start = time.time() 


3. Initializing Model 


Model initialization is the process of determining the necessary model parameters such as the basic value, the 
trend value, and the seasonal indices for the selected forecast model. 
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model = build_model() 


Model: "sequential" 


Total params: 2,259,265 
Trainable params: 2,225,153 
Non-trainable params: 34,112 


4. Loading the Data 





We'll use the skin cancer dataset's binary classification of benign and malignant skin cancers. Following that, we 


must label. 
import pandas AA po 


dft=0d.read_csvi’ ../input/j]peq-melanoma-256e756/train.csv’ | 


from sklearn.model: selection inport train_test_split 


train. valid = train_test split(dt. test size=8.2) 


5. Dataset 
Let take a look at our dataset. 


tral 
NT ogo Adin Pate = J el aoe ato Jone!) Cho lenis Hisqnosis 
1 Oe ISIS? 1007 P Sb oe aoe emi! 250) torso new 
15 2 He PAG male 00 lors NEE 
256 SIC tOeS3 58 PoS6507d5 mal 5.0 pper axtremi unknown 
a1 | UA Sa P ermal i I nevu 
PRGAe: «ISI AASaed IPAGASAD i 0.0 Lips i wien 
= a lim 7 9FSo5 P_11?f isl AO. x ii pee 
a259o Jl ssd02 | female C.0 torso unknown 
| fede [= Zid a Hod 4 mel lI OWED PSTLen Une 
44 Pol? eet ST PF 180d) r ee # F Lurker 
27439 ISICL830 756 Pp Feber ya 6 6(temale a) = Unknow 


amign 
anign 





@ IJTSRD | Unique Paper ID —- IJTSRD46384 | Volume—5 | Issue—6 | Sep-Oct 2021 


Page 274 


International Journal of Trend in Scientific Research and Development @ www. itsrd.com eISSN: 2456-6470 


6. Data Processing 
We set up generators to read images from our source folders, then we transformed them to float32 tensors, and 
fed them to our network. 


In our case, we'll change the pixel values to the [0, 1] range before pre-processing the images (all values are now 
in the [0, 255] range). As required by the networks, the input data must be scaled to 224x224 pixels as an input. 


train_generator = train_datagen.flow_from_dataframe( 


dataframe=train, 


directory= ../input/jpeg-melanoma-256x256/train/', 
x_col="image_name’ , 
y_col="target’, 


target_size=(256, 256), 
batch_size=32, 


class_mode='binary' ) 


validation_generator = test_datagen.flow_from_dataframe( 


dataframe=valid, 


directory='../input/jpeg-melanoma-256x256/train/', 
x_col="image_name" , 
y_col="target , 


target_size=(256, 256), 
batch_size=16, 


class_mode=‘binary' ) 


Found 26588 validated image filenames belonging to 2 classes. 


Found 6626 validated image filenames belonging to 2 classes. 





7. Building the Model 
It's as simple as layering a linear classifier on top of the feature extractor with the Hub module. For speed, we 
start with a non-trainable feature extractor. 


feature extractor .KerasLayer (MODULE HANDLE, input shape=IMAGE SIZE+(3,), output 
do fine tuning = False 
F do fine tuning: 
feature extractor.trainable = True 
> layer in base model.layers[-30:]: 
layer.trainable =True 


feature extractor.trainable = False 


¢("Building model with", MODULE HANDLE) 

model = tf.keras. ([ 
feature extractor, 
tf.keras.layers.k tk; 
tf.keras.layers.Dense(512, activation='relu'), 
tf. keras.layers. (rate=0.2), 


tf.keras.layers. (train generator.num classes, activation='softmax', 


kernel regularizer=tf.keras.regularizers.12(0.0001)) 
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Building model with https: //tfhub.dey/google/tf2-preview/mobilenet v2/feature vector 
Model: “sequential_4" 





oe a) UryPe) Param # 


pares reve: 16 (KerasLayer) (None, 128@) 2257984 


flatten_i1 (Flatten) (None, 1288) a 


dense 22 (Dense) (None, 655872 


dropout 11 (Dropout) 


dense 23 (Dense) 


SS SS SS Sm SS a SS a 


2,914,882 
656,898 
2,257,984 


Total params: 
Trainable params: 
Non-trainable params: 


8. Training Model 
We validated each step by training the model with the validation dataset. 


LEARNING RATE 


S.optimizers. (learning rate=LEARNING RATE), 
ay 
KE, POCHS= 
history = model. ( 
train generator, 
steps per epoch=train generator.samples//train generator.batch 
epochs=EPOCHS, 


size, 


validation data=validation generator, 





validation steps=validation generator.samples//validation generator.batch si 
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Model training 


9. Checking Performance 


We plotted the training and validation, accuracy and loss. 
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matplotlib.pylab plt 
numpy np 


acc = history.history['accuracy'] 
val _ acc = history.history['val accuracy'] 


loss = history.history['loss'] 


val loss = history.history['val_ loss’] 


(=) ole lod oka wan ar belo (— Mmm ar- tele (—6 G1 -10)03 fp) 


jo) hop ke bha-s Gan kets A-- 8 CROPS DB) 

PLC subplot tl. 2...) 

plt.plot(epochs range, acc, label='Training Accuracy') 
plt.plot(epochs range, val_acc, label='Validation Accuracy') 
plt.legend(loc='"lower right") 

plt.title('Training and Validation Accuracy') 
plt.ylabel("Accuracy (training and validation) ") 
plt.xlabel("Training Steps") 


BLU SubploL tl, 2,72) 

plt.plot(epochs range, loss, label='Training Loss') 
plt.plot (epochs range, val _ loss, label='Validation Loss") 
jo) ome R=10(-selod @ Relea Ab] e)e\-s ame an ke selene 

plt.title('Training and Validation Loss') 
plt.ylabel("Loss (training and validation) ") 
plt.xlabel("Training Steps") 








pit. show () 
The Graph: 
Training and Validation Accuracy Training and Validation Loss 
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10. Testing Model 


We picked five images at random from the validation dataset and made the following prediction: 


random 
cv2 
upload (filename) : 
img = cv2.imread(os.path.join(train dir, filename) ) 
img = cv2.resize(img, ( e be) 
img = img / 
img 


pre result (image): 
xX = model.predict (np.asarray([img])) [09] 
classx = np.argmax (x) 

{Labels[classx]: x[{classx]} 


images = random.sample(validation generator.filenames, 


for idx, filename =Selblit=par- Ree iT le (= Fo) 
shite ME mnblenRel-(ou Gum K-ser-11(=9) 
prediction = pre result (img) 
print("class: %s, confidence: $f" % (list(prediction.keys()) [0], list (prediction 
plt.imshow (img) 
jo) Ropp meke ibba-a @ked.@) 
pit.show () 
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11. Confusion Matrix 
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12. Export Model and converting it to TFlite 


converter = tf.lite.TFLiteConverter.from_keras_model(model) 
tflite_model = converter.convert() 


# Save the model. 
with open('model.tflite', ‘wb') as f: 
f .write(tflite_model) 


SOURCE: class: malignant, file: msalignant/18. jpe SOURCE: class: malignant, file: malignant/10. jpeg 
PREDICTED: class: Malignant, confidence: 6.913409 PREDICTED: class: Malignant, confidence: 6.976365 
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IMPLEMENTATION 


We Later converted the model to TF lite format to 
implement on some real-life imaging. We used the 


app on an android app that we made using Java. 


Out Look of Our App: 
Onco SkinCheck 


Skin Cancer 


Tnaeis bution belay 


| De ol nite { 
yor wee Baler el i your ieee 


Consull vat ou diestor 


MOTE: This application runs on APISO go if your appe shows 
any problem JUST RESTART THE APE 


Developed By Sumani Dasgupta 


cs a 





Our App while its working (Cancer Cell detected): 
Onco SkinCheck 
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When Cell is benign: 
Onco SkinCheck 


Tif akin fella are ae | 





The app was detecting Cancer cells at a 97% 
accuracy. 


CONCLUSION 

Looking back on this project, the overall outcome of 
results to be observed. This can be evaluated by 
looking at how well our objectives were met. We 
have successfully constructed our learning model that 
can predict the cancer cells by image processing. The 
performance at the training phase does not give the 
complete picture, we have to test it on real-data. So 
we later introduced it to an android application. This 
passion project helped us to get the In-depth 
knowledge and experience on machine learning and 
image processing. 
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